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Abstract 

We consider the simplest split-merge Markov operator T on the infinite-dimensional 
simplex Si of monotone non-negative sequences with unit sum. For a sequence x £ Si, 
it picks a size-biased sample (with replacement) of two elements of x; if these elements 
are distinct, it merges them and reorders the sequence, and if the same element is picked 
twice, it splits this element uniformly into two parts and reorders the sequence. We 
prove that the means along the T-trajectory of the <5-measure at the vector (1, 0, 0, . . .) 
converge to the Poisson-Dirichlet distribution PD(l). 
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1 Introduction 

In this paper we investigate the simplest split-merge Markov operator T originally introduced 
by A. Vershik. This operator acts on the infinite-dimensional simplex Si of monotone non- 
negative sequences with unit sum as follows. For x — (xi, X2, ■ ■ ■ ) G Si, consider a size-biased 
sample (with replacement) of two elements of x. If two different elements picked, 
then we merge them into Xi + Xj (and reorder the sequence to obtain a point of Si); and if 
the same element x^ is picked twice, then we split it into two parts t, Xi — t (with t uniformly 
distributed on [0,Xj]) and reorder the sequence. 

Conjecture (A. Vershik) The only T -invariant distribution on Si is the Poisson-Dirichlet 
distribution PD(1). 

The Poisson-Dirichlet distribution PD(1) introduced by J. F. C. Kingman || is perhaps 
the most distinguished distribution on the infinite- dimensional simplex. It arises in many 
various problems from different areas of mathematics and applications and has many re- 
markable properties and characterizations. The conjecture above may be considered as an 
attempt to give a new very important characterization of this measure. 

In the original setting the operator T arises from the representation theory of the infinite 
symmetric group &<x>- Namely, the group ©00 acts by shifts R g on the projective limit ©°° 
of finite symmetric groups (the so-called space of virtual permutations). On the other hand, 
there is a projection from ©°° onto the infinite-dimensional simplex which associates with 
a virtual permutation the sequence of its relative cycle lengths. The simplest split-merge 
operator T under consideration is the projection on Si of the shift -R(i,2) corresponding to 
a transposition in ©oo. Following this approach, N. Tsilevich [7| proved the uniqueness 
of invariant measure for the family of Markov operators {T g } gG Q oc arising from the action of 
the whole group ©oo. We would also like to mention the paper [jj proving the uniqueness of 
invariant measure for a closely related Markov operator. 

Recently, the same operator T appeared as a simplified model in a quite different context 
of triangulations of random Riemannian surfaces (R. Brooks). Inspired by this motivation, 
E. Mayer- Wolf, O. Zeitouni and M. Zerner have proved (by purely analytic methods) the 
uniqueness of T-invariant measure under strong smoothness assumptions on the measure 
(A. Vershik, personal communication). However, the problem for arbitrary Borel distribu- 
tions is still open. 

In this note we prove another partial result. Consider the trajectories of the 5-measures 
5 X , x G Si, under T. We show that for almost all points x with finitely many non-zero 
coordinates (e.g., x — (1, 0, . . . )), the binomial means along the trajectories converge to the 
Poisson-Dirichlet measure PD(1). 

We would like to emphasize that we essentially exploit the relation of this problem to 
symmetric groups and the techniques developped in || [?J . We hope that these methods will 
eventually allow to prove Vershik's conjecture. 

Let us describe the problem more precisely. Let 
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be the simplex of non-increasing non-negative sequences with sum at most one, and Si = 
{iGE: YliLi Xi = 1} be its subsimplex consisting of sequences with sum exactly one. The 
most elementary definition of the Poisson-Dirichlet measure PD(1) is as follows. 

Definition 1 Let C/i, C/2, • - ■ be a sequence of i.i.d. random variables uniformly distributed 
on the interval [0, 1]. Let 

n-l 

V n = U n Hil-Ui). 

1=1 

It is easy to see that Ylili Vi — 1. The Poisson-Dirichlet measure PD(1) on the simplex Si 
is the distribution of the order statistics V(i) ^ V( 2 ) ^ . . . of the sequence Vi, V 2 , 

Now consider the following Markov operator on S x : 

V(xi -\- Xj, X\, . . ., Xi+i, . . ., Xj—i, Xj+\, . . .), i < j, 

with probability 2xiXj] . . 

V (t, Xi t, x±, . . ., Xi—i, Xi+i, ■■■)■, i — 1, 2, . . ., 

with probability a^dt, t G [0, Xj], 

where the operator V arranges the elements of a sequence in non- increasing order. Denote 
by E the identity operator on Si. We may consider these operators also on the space .M(Si) 
of probability Borel measures on Si. 

Let v>o denote the 5-measure at the point (1, 0, 0, ... ) G Si. In this note we prove the 
following theorem. 

Theorem 1 The sequence 

( E + T\ m 

{—) "° 

converges (weakly) as m — > 00 to the Poisson-Dirichlet measure PD{1). 



Corollary The sequence 

converges (weakly) as m — > 00 to the Poisson-Dirichlet measure PD(1) for almost all (with 
respect to the Lebesgue measure) points x G Si with finitely many non-zero coordinates. 

The author is grateful to A. Vershik for suggesting the problem and for many fruitful 
discussions. 
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2 The space of virtual permutations 



In this Section we give a necessary background concerning the space of virtual permutations 
and central measures on this space. 

Definition 2 Given a subset J C [n] and a permutation w G & n , denote by it n ,jw a permu- 
tation of the set J obtained by removing from cycles of w all elements that do not belong to 
J. The permutation Tt nt jw is called the induced permutation for w on J. 

The induced permutation on the subset J = [m] is denoted by n n ^ m w. We shall usually 
omit the index n if it is clear from the context. For example, if w = (6 3 51) (4 2) (7), then 
n 4 w = (31) (4 2). 

Definition 3 The space of virtual permutations G°° is the projective limit &°° = 

hm(5„ of finite symmetric groups & n with respect to canonical projections 7r n +i,n '■ &n+i — ► 

Thus a virtual permutation is a sequence (wi, w 2 , •••) G ©i x (5 2 x . . ., such that 7i n w n+ i = 
w n for all n EN. 

Note that 7r n commutes with shifts on elements of the group & n , i.e. for all N > n, the 
equality TT n (g^ 1 hg 2 ) = g^ 1 Tc n (h)g 2 holds for all h G &n and gi,g 2 G & n . Let 600 = U n ^i& n 
be the infinite symmetric group. Then the group G = Soo x acts on the space of virtual 
permutations &°° as 



A sequence {/i n } of distributions on finite symmetric groups & n is called coherent, if it 
accords with projections vr n+l n , i.e. vr n+l n /i n+1 = fi n for all n£l Each coherent sequence 
of distributions defines a Borel measure \x = lim \i n on the space of virtual permutations (5°°, 
and each Borel measure on @°° can be presented in this form. In what follows all measures 
are assumed to be Borel and normalized. If all measures \L n are central, i.e. invariant under 
inner automorphisms of & n , then the limit distribution \x is invariant under the diagonal 
subgroup K = {(gi,g 2 ) G G : g x = g 2 }. 

Definition 4 K -invariant measures on the space of virtual permutations &°° are called 
central. The set of all central measures is denoted by Ai K (&°°) . 

Example 1. Let m n be the Haar measure on the symmetric group & n . The sequence {m n } 
is coherent, and the measure m = limm n is called the Haar measure on the space of virtual 
permutations. It is clear that this measure is central. Moreover, it is invariant under the 
whole group G. 

The problem of describing all central measures on the space of virtual permutations is 
parallel to Kingman's theory of partition structures (see, e.g., 0, Let us reformulate his 
results in our terms. 
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Each point 16E defines a central measure P x on the space of virtual permutations. For 
x G Si they are described as follows. Let us put elements 1,2,... at random into cycles and 
label at random these cycles according to the following rule: 

• at the first step the element 1 forms a 1-cycle, and we label it by j with probability Xj 

(j = 1,2,...); 

• if the elements 1, . . ., m are already placed, and they form k cycles with labels i%, . . ., ik, 
then the element m + 1 is inserted in one of possible positions in the jth cycle with 
probability Xi. {j = 1, . . ., k), or forms a new 1-cycle labelled by i with probability Xi 
(i ^ zi, . . .,i k ). 

After the nth step of this procedure we obtain a random permutation w n G (5„. It follows 
from construction that the sequence {w n } is coherent, thus it defines a random element of 
and we denote its distribution by P x . 

Example 2. If x = (1 ; 0, . . .), then the projection of the measure r = P x ° onto & n is the 
measure r n uniformly distributed on one-cycle permutations. 

Example 3. The projection of the measure P x , x G Si, on ©2 is given by 

J?((l)(2)) = 2 X *J> 

l^i<j<oo 
00 

P x ((12)) = £4 

i=l 

Theorem 2 ([Q) Let n be a central measure on the space of virtual permutations. Denote 
by li(w n ) ^ hi^n) ^ . . . the cycle lenghts of a permutation w n G & n in non-increasing order. 
Then the limits 

X,H=hm^, i = 1,2 (3) 

called the relative cycle lengths of the virtual permutation u, exist for almost all with respect 
to /j, virtual permutations uo = (wi,W2, . . .). For every x G S ; the conditional distribution of 
io, given (Xi,X2, . . .) = x, equals P x . Thus 

fi = J P x du(x), (4) 

where v is the distribution of the vector X = (Xi, X%, . . .) on E. 

In particular, if the measure fi is saturated, i.e. the sum of relative cycle lengths is equal 
to 1 a.s. with respect to fi, then the corresponding measure v is supported by £1. 

Example 4. The distribution of the relative cycle lengths of virtual permutations with 
respect to the Haar measure is the Poisson-Dirichlet distribution PD(1). 
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Thus there is a one-to-one correspondence p : u i— > p between Borel distributions u on 
E and central distributions p on 6°°. Given i/, the corresponding central measure p = p(p) 
is recovered via (Q). And given /i, the corresponding measure z/ = p~ l (p) on E is the 
distribution of the relative cycle lengths with respect to p. 

Remark. The space of virtual permutations (5 00 was introduced by S. Kerov, G. Olshanski 
and A. Vershik 0. They considered a family of quasi- invariant distributions on (3°° with 
respect to the action of the group G = &oo x and studied the associated family of unitary 
representations of the infinite symmetric group. 

3 Reduction lemmas 

The infinite symmetric group &oo acts on the space of virtual permutations by two-sided 
shifts (D. Denote by R g : 6°° -> 6°° the right shift on g £ G^. 

One can easily check that the image p 9 = R g p of a central measure p enjoys the property 
that the normalized cycle lengths exist for almost all with respect to p 9 virtual permutations. 
Hence we may define the projection of R g on E, which is already a Markov operator. It 
sends a point x £ E to p~ l (ug), where to is a random virtual permutation with distribution 
P x . Thus the corresponding operator T g on the space M(E) of probability measures on E 
completes the commutative diagram 

M K {&°°) — M(6°°) 
p 

M(E) -^-> M(E). 

One can easily see that the operator T g depends in fact only on the conjugacy class of g, 
i.e. on its cycle structure. 

The subsimplex E a = {x £ E : ^ = a} is T s -invariant for all a £ [0, 1] and all 
g £ 600 (since the shift by a finite permutation does not change the sum of the relative cycle 
lengths of a virtual permutation). It is proved in Jj]] that these sets are ergodic components, 
and the ergodic measure concentrated on E a is the image of PD(1) under the homothetic 
transformation r a : Ei — > E a . In particular, 

Theorem 3(0) The only measure on Ei which is invariant under the family {Tg}^^ is 
the Poisson-Dirichlet distribution PD(1). 

The relation of this theorem to our problem is given by the following lemma (note that 
T e is obviously the identity operator, and we may consider the identity operator E on E x as 
the restriction of T e on Ei). 

Lemma 1 The Markov operator T on Ei defined by (jl[) is the restriction on Ei of the 
operator T(i 2) (= T g , where g is an arbitrary transposition in &oo)- 
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Proof. Follows from the definition of T g and Example 3. 

Now we cite some preliminary lemmas from [[F] which we shall use in the sequel. First, 
we reformulate the problem in terms of the space of virtual permutations. Since there is a 
one-to-one correspondence between Borel measures on E and central measures on (5°°, one 
may think of the operator T g as acting on the space M K (&°°). More exactly, denote by T g 
the operator on A / l K (&°°), corresponding to T g , i.e., the image of T g under the map p. 

Lemma 2(0) Let p e M(&°°). Then f gi i = PR g fi, where P : M(6°°) -> M K (6°°) is 
the projection (conditional expectation) onto the space of central measures. 

In particular, let T = T(i,2) and E = T e (the identity operator on Then our 

Theorem |l| is equivalent to the following proposition. Let r = lim r n = P x ° be the central 
measure on & 00 corresponding to the vector Xq — (1, 0, 0, . . .) G £1 (see Example 3). 

q 



Proposition 1 Let r q = (^f^ t. Then 



lim r q = m, 

q^oo 

where m is the Haar measure on the space of virtual permutations. 

The ergodic method allows one to express the operator T g in the following form. 
Lemma 3(0) Let p be a central measure on the space of virtual permutations. Then 

1 



A -yoN\ ^ 



T g fi = lim — > u {R g n)u. (5) 



Finally, the formula (|5|) may be restated in terms of distributions on finite symmetric 
groups. 

Lemma 4(0) Let n be a central measure on the space of virtual permutations, and p n be 
the corresponding coherent family of measures on finite symmetric groups. Then the finite- 
dimensional projections of the measure T g fi are given by the following formula, 

(2>)„(u) = J2 Vn+k(wg n ) Vn, k e N, Vg e &\ \/u e 6 n , (6) 

™ee n+fe 

where & k = &[n + 1, . . ., n + k] C & n +k- 
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4 Shifted projection of irreducible characters of sym- 
metric groups 

In this section we present a formula obtained in M which describes the action of the operator 
T g on irreducible characters x\ of the symmetric group & n +\. It generalizes a known formula 
for the action on xx of the canonical projection 7r n . 

Fix n, k £ N, N = n + k and g G & k = &[n + 1, . . ., n + k]. Define a projection 

n N,n '■ ®N -> ©n aS 

Then the formula (Q) from Lemma ^ may be formulated as follows: 

(Tgfl) n = K 9 Nn LL N 

for all n,k £ N, N = n + k, g e & k . We may consider the central measure fi n as a central 
function on the symmetric group & n . Since the characters of irreducible representations 
form a basis in the space of central functions, it is interesting to consider the action of n g N n 
on the irreducible characters. Recall that the irreducible representations of the symmetric 
group & n are indexed by partitions A = (Ai ^ A2 ^ . . . ) of the number n (which we identify 
with the corresponding Young diagrams). Denote by Il n the set of all partitions of n. Thus 
we are interested in a formula for 

n 9 N ,nX\(u) = Xx(wg), u G 6„. 

wee N 

Remark. If g = e this is the action of the canonical projection 7r n on the irreducible 
characters. As follows from the arguments of |5|, Sect. 4.6], 

TTnXA = X! ( C ( A \ V) + !) A G n n+1 , (7) 

where /i /* A means that the diagram fi G Il ra is obtained from A by removing one cell. 

We recall some definitions related to Young diagrams. The Young diagram D\ of the 
partition A G IT n is the set {(i, j) : i, j G Z, i ^ 1, Aj ^ j ^ 1}. If x — G D\, then x is 
called a cell of the diagram D\. The content of a cell 2 = is the number c(x) = j — i. 
Let m < n, X e IT n , /x G Il m . We write A D /x if Aj ^ Hi for all i = 1,2,..., that is, the 
Young diagram of A contains the Young diagram of /i. The set-theoretic difference of these 
diagrams is called a skew diagram. A skew diagram is called a skew hook, if it is connected 
and does not contain two cells in one diagonal. If a skew hook contains k cells, it is called 
a skew k-hook. Denote by the set of skew fc-hooks. The height of a skew hook A \ \x is 
the number /(A \ \i) equal to the number of rows it intersects minus one. A skew diagram is 
called a horizontal m-strip, if it contains m cells and has at most one cell in each row. 
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Proposition 2 ([0) Let xx be the character of the irreducible representation of the sym- 
metric group & n corresponding to the partition A £ Il y, and let g £ & k be a permutation 
consisting of one cycle of length k. Then 

<,nXA= £ ("l) i(AV) J! (^H 1 )*^ (8) 



Corollary Le£ be the character of the irreducible representation of the symmetric group 
& n corresponding to the partition A £ H n +k, and let g £ & k be a permutation with cycle 
structure v = {i>\, . . ., v s ) £ 11^. Then 

A,nXx = E(" 1 ) HM) II ( C ^) + VXv ( 9 ) 
m xex\fj, 

where the sum is taken over all s-tuples M = . . .,/x s } such that A = fio D fi± D . . . D 
li s £ II n; /ij_x \ fii is a skew Vi-hook for all % — 1, . . ., s ; and Z(M) = J^* =1 \ Hi)- 

5 Proof of Proposition J] 

Let iV = n + 2m, and A £ 11^. 



Lemma 5 



±{E + T) rn xx = X, ■ ( II Wx) + 1) J • p(A, /*), (10) 

where p(X, //) zs i/ie number of all sequences M = {/i = /i, . . ., /i m = A} st/c/i t/icrf 
/ifc £ U n+ 2k, and fik+i \ fJ>k is a horizontal 2-strip for all k = 0, . . . , m — 1. 



Proof. Note that 



7r n+2,n + 7r „+2,n \ / 7r n+4,n+2 + 7r n+4,n+2 \ / n N,N-2 + ^N,N-2 



Xa, 



where = ^1+21 ■ Then it follows from Proposition § that the projection of ^(E + 

T) m XA onto <5 n equals 

e ^ • ( n ( c w+ 1 )) -E a ( M )' 

A*Gn„ \a:eA\/i / M 
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where the sum is taken over all sequences M = {/x = fi, /xi, • • •, /-t m = A} such that C /ifc+i, 
fj, k e n n+2 fc, and a(M) = n™=i "(^fc \ A^fc-i), where a(-) = Ql (')+" 2 (-) with 



Ql(A \ V) 



1, if A \ v is a 2-row or a 2-column, 

2, otherwise. 



1, if A \ v is a 2-row, 
ol%{\ \ v) = ^ — 1, if A \ i/ is a 2-column, 
0, otherwise, 



that is, 

1, if A \ v is a horizontal 2-strip, 
0, otherwise (that is, if A \ v is a 2-column). 



a(A \ v) 



The Lemma follows. 

Now each central measure on ©at can be decomposed into a combination of irreducible 
characters. In particular, let 

T N(g) = ^2 aA *A(»- 

Aeriiv 

It is easy to see that the coefficients of this decomposition equal 

a A = ^ XA ((i2...iV)). 



hook partitions \^ = [kl N k ), for k = 1, . . ., N, and o^iat-*) = - 1 L — . Thus 



It follows from the Murnaghan-Nakayama rule that these coefficients are not zero only for 

(_l)iV- 

l (ki N ~ k ) — — nT 

x 



k=l 



Now we are ready to prove Proposition [l|. 

First let n = 2. Denote c(A,/i) = Y\ xe x\u( c ( x ) + -Q- Then the projection r| of r q on (5 2 
can be written as a q X(2) + b q X(i 2 ), where 



«, = E^vr • c (^(2))-KAr,(2)), 

fe=l 



fe 9 = E^vr- c ( A ^( l2 ))-^( 12 ))- 

fe=i 

It is clear that the coefficient p(\jl, (2)) is not zero only for k = N, and c((N), (2)) = 
3 • 4 • . . . • N = N\/2. Thus we obtain that a q = 1/2. The corresponding term \x{2) = nil is 
just the Haar measure on ©2. Thus it suffices to show that b q — > as q — > 00. 
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Let us calculate the coefficient p(Aj^, (l 2 )). It follows from Lemma [5] that we must calcu- 
late the number of paths from (l 2 ) to (kl N ~ k ) such that at each step we add either two cells 
in one row or two cells in different rows and different columns. It is easy to see that this num- 
ber equals (^Jj^J. The content coefficient equals c((kl N ~ k ), (l 2 )) = {-l) k+1 k\(N - k - 1)!, 
thus the summand in b q corresponding to A^ equals 

q\k\(N- k- 1)! k\q\ 



N\(N - k - l)\(k - q - 1)! N\(k-q-l)\' 

Hence, 

q}_ ^ k\ q\(q+l)\ ^ k\ 



q\(q + 1)1 ^ ( k \ q\(q + 1)\ f2q + 2 



E 



N\ ^ \q + lj N\ \q + 2 J q + 2 

k=g+l v* / \ i / i 

as q — > oo, as desired. 

The proof for n > 2 goes in the same way with a bit complicated calculations. It is clear 
that the decomposition of contains only hook characters, that is, 



T n = 5^Off(0X(ll»- 



1=1 

and 



N 



fc=i 



If / = n, there is again only one non-zero summand corresponding to k = N, 

1 JV! 1 
9V 7 AH n! n\ 

and a g (n)x( n ) = m n , that is, the Haar measure on & n . Thus we only have to prove that 
a q (l) — > as q — > oo for I < n. Easy calculations show that p(X^,Xf) = ( 2q ^ k+l ), and 

cWA?) = (-l) l - k ^§ ■ Thus 

(-l) l+N q]_ ^ k\(N- k- 1)! 



l\{n-l-l)\ N\ k j^ +i (2q-k + l)\(k-l-q)\ 
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But { Z~k-~n+iy. ^{N-k- l)"-*- 1 < (g + n - I - l)"-'" 1 . Then 

KCOI < const ^ I 

k=q+l 



(g + n-Z-l^-'-VCff + O 1 /2g + Z + l\ . 

^ const • — • , , < 

AH \q+l+l 

(q + n-l-l) n - 1 - 1 (2q + l + l)\ ^ 

^ const • ■ — -, -— ^ 

q + l + 1 {2q + n)\ 

1 fq + n-l-l\ n - 1 - 1 
< const • — > 



q + l + 1 V 2g + Z + 2 

as q — ► oo. Proposition follows. 
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